Manulex-infra: distributional characteristics of grapheme-phoneme mappings, and infralexical and lexical units in child-directed written material.
نویسندگان
چکیده
It is well known that the statistical characteristics of a language, such as word frequency or the consistency of the relationships between orthography and phonology, influence literacy acquisition. Accordingly, linguistic databases play a central role by compiling quantitative and objective estimates about the principal variables that affect reading and writing acquisition. We describe a new set ofWeb-accessible databases of French orthography whose main characteristic is that they are based on frequency analyses of words occurring in reading books used in the elementary school grades. Quantitative estimates were made for several infralexical variables (syllable, grapheme-to-phoneme mappings, bigrams) and lexical variables (lexical neighborhood, homophony and homography). These analyses should permit quantitative descriptions of the written language in beginning readers, the manipulation and control of variables based on objective data in empirical studies, and the development of instructional methods in keeping with the distributional characteristics of the orthography.
منابع مشابه
Approximating Phonotactic Input in Children's Linguistic Environments from Orthographic Transcripts
Child-directed spoken data is the ideal source of support for claims about children’s linguistic environments. However, phonological transcriptions of child-directed speech are scarce, compared to sources like adult-directed speech or text data. Acquiring reliable descriptions of children’s phonological environments from more readily accessible sources would mean considerable savings of time an...
متن کاملImproving grapheme-based ASR by probabilistic lexical modeling approach
There is growing interest in using graphemes as subword units, especially in the context of the rapid development of hidden Markov model (HMM) based automatic speech recognition (ASR) system, as it eliminates the need to build a phoneme pronunciation lexicon. However, directly modeling the relationship between acoustic feature observations and grapheme states may not be always trivial. It usual...
متن کاملTraining grapheme to phoneme conversion in patients with oral reading and naming deficits: A model-based approach
A model-based treatment focused on improving grapheme to phoneme conversion as well as phoneme to grapheme conversion was implemented to train oral reading skills in two patients with severe oral reading and naming deficits. Initial assessment based on current cognitive neuropsychological models of naming indicated a deficit in the phonological output lexicon and in grapheme to phoneme conversi...
متن کاملHemispheric specialization for visual words is shaped by attention to sublexical units during initial learning
Selective attention to grapheme-phoneme mappings during learning can impact the circuitry subsequently recruited during reading. Here we trained literate adults to read two novel scripts of glyph words containing embedded letters under different instructions. For one script, learners linked each embedded letter to its corresponding sound within the word (grapheme-phoneme focus); for the other, ...
متن کاملUsing Phoneme Distributions to Discover Words and Lexical Categories in Unsegmented Speech
When learning language young children are faced with many formidable challenges, including discovering words embedded in a continuous stream of sounds and determining what role these words play in syntactic constructions. We suggest that knowledge of phoneme distributions may play a crucial part in helping children segment words and determining their lexical category. We performed a two-step an...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Behavior research methods
دوره 39 3 شماره
صفحات -
تاریخ انتشار 2007